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Abstract. In this paper we systematically investigate the connections between logics 
with a finite number of variables, structures of bounded pathwidth, and linear Datalog 
Programs. We prove that, in the context of Constraint Satisfaction Problems, all these 
concepts correspond to different mathematical embodiments of a unique robust notion that 
we call bounded path duality. We also study the computational complexity implications 
of the notion of bounded path duality. We show that every constraint satisfaction problem 
CSP(B) with bounded path duality is solvable in NL and that this notion explains in a 
uniform way all families of CSPs known to be in NL. Finally, we use the results developed 
in the paper to identify new problems in NL. 



1. Introduction 

The constraint satisfaction problem provides a framework in which it is possible to 
express, in a natural way, many combinatorial problems encountered in artificial intelligence 
and elsewhere. A constraint satisfaction problem is represented by a set of variables, a 
domain of values for each variable, and a set of constraints between variables. The aim of a 
constraint satisfaction problem is then to find an assignment of values to the variables that 
satisfies the constraints. 

Solving a general constraint satisfaction problem is known to be NP-complete |H| I29| . 
One of the main approaches pursued by researchers in artificial intelligence and compu- 
tational complexity to tackle this problem has been the identification of tractable cases 
obtained by imposing restrictions in the constraints (see jU H3 03 EH ED ECU IZH HU HH1 1501 

EJE2EE1). 

Recently p2] (see also j20j ) it has been observed that the constraint satisfaction problem 
can be recast as the following fundamental algebraic problem: given two finite relational 
structures A and B, is there a homomorphism from A to B? In this framework, the problem 
of identifying which restrictions in the constraints guarantee tractability is equivalent to 
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deciding for which structures B, the homomorphism problem when only A is part of the 
input, denoted by CSP(B), is solvable in polynomial time. In this paper we study this 
framework. 

One of the simplest examples of constraint satisfaction problems of this form is obtained 
when B is fixed to be a /c-clique for some k > 2. In this case, an instance CSP(Kfe) 
is again a graph A and the question of existence of an homomorphism from A to B is 
equivalent to deciding whether A is fc-colorable. 

Feder and Vardi introduced one general condition for tractability of CSP(B) that 
accounts for many of the tractable cases of CSP(B). More precisely, Feder and Vardi 
observed that for many polynomial-time solvable Constraint Satisfaction Problems of the 
form CSP(B) there is a Datalog Program that defines the complement of CSP(B). 

In order to illustrate this consider now the particular case when B is K2, that is, B is a 
graph with two nodes and an edge between them. As it has been observed before, CSP(K2) 
is the set of all 2-colorable graphs. It is well-known that a graph is 2-colorable iff it does 
not have odd cycles. The following Datalog Program asserts that the graph A = (V, E) 
contains an odd cycle: 

P(x,y) :- E(x,y) 
P(x,y) :- P(x,z),E(z,u),E(u,y) 
Q :- P(x,x) 

This Datalog Programs has three rules. These rules are a recursive specification of two 
predicates P, and Q, called IDBs. Predicate P(x, y) holds whether there exists a path in 
A of odd length from x to y and predicate Q, which acts as goal predicate, holds if there 
exists a cycle of odd length . 

Since the seminal results of Feder and Vardi, the language Datalog has played a promi- 
nent role in the study of the complexity of the CSP. In particular, several connections 
between Datalog Programs and well-established notions developed in the area of constraint 
satisfaction problems and graph homomorphism have been explored. 

One of this notions is that of bounded treewidth duality initially introduced in the area 
of H-coloring which can be reformulated as the constraint satisfaction problem CSP(H) 
where H is a graph. In a H-coloring problem we are given a graph G as an input and we 
are asked where there exists an homomorphism from G to H. It has been observed JJj 
that the vast majority of the tractable cases of CSP(H) have an obstruction set of bounded 
treewidth. An obstruction set of CSP(H) is any set S of graphs not homomorphic to 
H such that for every graph G not homomorphic to H there exist a graph in S that is 
homomorphic to G. Every graph H having an obstruction set of bounded treewidth is said 
to have bounded treewidth duality. Bounded treewidth duality turns out to be equivalent to 
definability in Datalog, as shown in ^2j. More precisely, a graph H, has bounded treewidth 
duality if and only if the complement of CSP(H) is definable in Datalog. The relationship 
between Constraint Satisfaction Problems, Datalog and structures of bounded treewidth 
has been further investigated in [27j, [23], and jHj. 

The goal of the present paper is to take a closer look inside the structure of Datalog 
Programs. In particular, we are interested in linear Datalog Programs pQ, which are Datalog 
Programs in which every rule has at most one IDB in the body. Many constraint satisfaction 
problems solvable with a Datalog Program are indeed solvable by a linear Datalog Program. 
In fact, the Datalog Program defining NON-2-COLORABILITY presented above is a linear 
Datalog Program. It is well known that problems solvable by a linear Datalog Program are 
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solvable in non-deterministic logarithmic space. This class has received a lot of interest in 
complexity theory. In particular, it is known that NL C NC and, therefore, problems in NL 
are highly parallelizable. 

In this paper we embark on a systematic study of the relationship between linear Data- 
log Programs, finite variable logics and structures of bounded pathwidth. We prove several 
different but equivalent characterizations of definability in linear Datalog. Most of them 
are in the realm of logic. In particular, we show that for every constraint satisfaction prob- 
lem of the form CSP(B), -i CSP(B) is definable in linear Datalog if and only if CSP(B) is 
definable in the logic M u , which is defined to be the subset of the infinitary logic L£o w in 
which only existential quantification, infinitary disjunction and a certain restricted infini- 
tary conjunction is allowed. Definability in linear Datalog is also shown to be equivalent to 
definability in restricted Krom SNP, which is defined to be the set of all existential second 
order sentences with a universal first-order part in prenex CNF form, in which we require 
every clause of the first-order part to contain only negated relation symbols and at most 
one positive and one negative second order-variable. We also obtain a combinatorial refor- 
mulation of definability in linear Datalog. More precisely, we show that for every structure 
B, the complement of CSP(B) is definable in linear Datalog if and only if CSP(B) has an 
obstruction set of bounded pathwidth. All of these different but equivalent reformulations 
of the same notion seem to provide some evidence that the class of problems definable is 
linear Datalog is an interesting and robust class. 

Finally we use our results to investigate which constraints satisfaction problems are in 
NL. Despite the large amount of tractable cases of constraint satisfaction problems identified 
so far, very few subclasses of constraint satisfaction problems are known to be in NL. To 
our knowledge, the only families of CSP problems known to be in NL are the class of 
bijunctive satisfiability problems [34] . including 2-SAT, which later on was generalized to 
the class of implicational constraints [22] (see also [201)) the class of implicative Hitting- 
Set Bounded (see originally defined (although with a different name) in e 34j, and the 
class of posets with constants invariant under a near-unanimity operation |2H]. First we 
observe that all this families of problems are particular cases of bounded path duality 
problems. Consequently, the results in our paper provide an uniform explanation of all 
known constraint satisfaction problems known to be in NL. Finally, we identify some new 
families of constraint satisfaction problems solvable in NL. 

2. Basic Definitions 

Most of the terminology introduced in this section is fairly standard. We basically 
follow |15j . A vocabulary is a finite set of relation symbols or predicates. In the following 
r always denotes a vocabulary. Every relation symbol R in t has an arity r = p(R) > 
associated to it. We also say that R is an r-ary relation symbol. 

A T-structure A consists of a set A, called the universe of A, and a relation R A C A r 
for every relation symbol R € r where r is the arity of R. Unless otherwise stated we 
will assume that we are dealing with finite structures, i.e., structures with a finite universe. 
Throughout the paper we use the same boldface and slanted capital letters to denote a 
structure and its universe, respectively. 

Let A and B be r-structures. We say that B is a substructure of A, denoted by B C A, 
if B C A and for every R G r, i? B C R A . If A is a r-structure and B C. A, then A|# 
denotes the substructure induced by A on fl, i.e., the T-structure B with universe B and 
i? B = R A n B r for every r-ary R £ r. 
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Let A and B be r-structures. We denote by A U B the r-structure with universe A U B 
and such that for all R € r, i? AuB = R A U R B . 

A homomorphism from a r-structure A to a r-structure B is a mapping h : A — > B such 
that for every r-ary R € r and every (»]_,..., a r ) € i? A , we have (h(ai), . . . , h{a r )) S i? B . 

We denote this by A — — > B. We say that A homomorphically maps to B, and denote this 
by A — ► B iff there exists some homomorphism from A to B. We denote by hom(A, B) 
the set of all homomorphisms from A to B. 

We will assume by convention that for every set B there exists one mapping A : — > B. 
Consequently, if A is a structure with an empty universe then {A} = hom(A,B). 

Let 

d\i ■ ■ ■ j a m be elements in A and let 61, ... , b m be elements in B. We shall write 
A, ai, . . . , a m — ► B, 61, . . . , b m to denote that there exists some homomorphism h from A 
to B such that h{ai) = b{, 1 < i < m. 

Let A be a r structure and let r' C r. We denote by A[r'] the r'-structure such that 
for every R<Et', R a ^ = R A . Similarly, if C is a collection of r-structures we denote by 
C[t'] the set {A[r'] : A € C}. STR denotes the class of all structures and consequently, 
STR[r] denotes the class of all r-structures. 

Finally, CSP(B) is defined to be the set of all structures A such that A — > B. 

3. Infinitary Logic 

The following definition is borrowed from [21]. Let r be a vocabulary containing only 
relational symbols and let {v 1, . . . , v n , . . . } be a countable set of variables. The class Lqo,<j 
of infinitary formulas over r is the smallest collection of formulas such that 

• it contains all first-order formulas over r. 

• if cp is a formula of Loo,w then so is —xp. 

• if ip is a formula of w and Vi is a variable, then (Vt>j)</? and (3vi)<p are also formulas 

of 

• if is a set (possibly infinite) of Loo,o; formulas, then \J ^ and f\ ^ are also formulas 

of ioo,w 

The first subscript of u indicates that conjunctions and disjunctions can be taken 
over arbitrary infinite sets and the second that only finite quantifier blocks are allowed. 

The concept of a free variable in a formula of Loo.u; is defined in the same way as for 
first-order logic. We use the notation (p(u%, . . . , u m , . . . ) to denote that u\,..., u m , . . . are 
different and that <p is a formula of Loo,£j whose free variables are among the variables 
U\, . . . ,u m , .... A sentence of Loo,^ is a formula ip of with no free variables. The 

semantics of Loo.u; is a direct extension of the semantics of first-order logic, with V ^ 
interpreted as a disjunction over all formulas in ^ and f\ ^> interpreted as a conjunction. 
If A is a structure over r and a±, . . . ,a m , . . . is a sequence of (not necessarily different) 
elements from the universe of A, then we write 

A,ai, . . . ,a m , . . . , \= <p(ui, . . . ,u m , ...) 

to denote that the structure A satisfies the formula p of when each variable Ui is 

interpreted by the element a\. 

Let k be a non-negative integer. The infinitary logic with k variables, denoted by u 
consists of all formulas of Loo.u; with at most k distinct variables. Let L k be the collection of 
all formulas of w , that are obtained from atomic formulas using infinitary disjunctions, 
infinitary conjunctions, and existential quantification only. 
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Let j be a non-negative integer. We will denote by j-restricted infinitary conjunction 
the infinitary conjunction /\ ^ when ^ is a collection of Loo.u; formulas such that (a) every 
formula with more than j free variables is quantifier-free and (b) at most one formula in \& 
having quantifiers is not a sentence. If furthermore the set is finite then we will call it 
j-restricted conjunction. 

Let < j < k be non-negative integers. Let M^ ,k (N^ k ) be the collection of all formulas 
of L^^, that are obtained from atomic formulas using infinitary disjunction, j-restricted 
infinitary conjunction (j-restricted conjunction), and existential quantification only. Finally, 
let Oi' k be the collection of all formulas of L^^, that are obtained from atomic formulas 
using j-restricted conjunction, and existential quantification only. We also put 

M w = \J M j ' k , N w = \J N j ' k , and O w = \J 0^ k 

0<j<k 0<j<k 0<j<k 

The following example illustrates the expressive power of these logics. 

Example 1 (Paths, Bipartiteness). Assume that the vocabulary consists of a unique binary 
relation E, and let ip n (x,y), re > 1 be the first order formula asserting that there exists a 
path or length n from x to y. The naive way to write ip n (x,y) requires n + 1 variables, 
namely 

3xi3x 2 . . . 3x n -iE(x,xi) AE(xi,x 2 ) A ••• A E(x n -i,y) 
It is well-known that <p n is equivalent to a formula in w (in fact in L 3 ). To see this, 
put <fi(x,y) = E(x,y) and assume, by induction on n, that (/j n _i(x,y) is equivalent to a 
formula in L 3 ^ u . Then 

<p n (x,y) = 3z[E(x,z) A 3x((z = x) A <^ n _i(ac,y))] 

A closer look at ip n (x, y) reveals that every conjunction used in the definition of ip n (x, y) 
is j-restricted and hence we can conclude that ip n (x,y) is in O 2 ' 3 (and hence in iV 2 ' 3 and 
M 2,3 ). 

Finally, let C be the set of all finite r-structures that, interpreted as graphs, are bipartite. 
It is well known that a graph is bipartite if and only if does not contain odd cycles. Therefore, 
the set C is defined by the the formula 

3x f\ ip 2n +l(x,x) 
n>0 

As we shall see later (Theorem ^) the expressive power of this logics is the same regard- 
less on whether or not we allow the use of equalities (=) in the formulas. By convention we 
shall assume that, unless otherwise explicitly stated, formulas do not contain equalities. 

We finish this section by stating without proof a very simple fact about these logics 
that will be used intensively in our proofs. 

Proposition 1. Let A, B be r-structures such that A — * B. For every sentence <p in 
-^oo u that does not contain universal quantification nor negation such that A |= (p we have 
that B |= if. 

4. Quasi-Orderings and Pathwidth of Relational Structures 

A quasi- ordering on a set S is a reflexive and transitive relation < on S. Let (S, <) 
be a quasi-ordered set. Let S', S" be subsets of S. We say that S' is a filter if it is closed 
under < upward; that is, if x € S' and x < y, then y £ S' . The filter generated by S" is 
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the set F(S") = {y G S : 3x G S" x < y}. We say that S' is an ideal if it is closed under < 
downward; that is, if x G S' and y < x, then y G S' . The ideal generated by S" is the set 
I(S") = {y £ S : 3x £ S" y < x} . Observe that every subset S' of S and its complement 
S\S' satisfy the following relation: S' is an ideal iff S\S' is a filter. 

Let 7 be an ideal of {S, <). We say that a set O C S 1 forms an obstruction set for 7 if 

z G 7 iff Vy G 0(y ^ x) 

That is, O is an obstruction set for 7 if 7 is the complement of F(0) 

Let t be a vocabulary. The set of r-structures, STR[r], is quasi-ordered by the ho- 
momorphism relation. In consequence, a set C of r-structures is an ideal of (STR[r], — >} 
if 

B G C, A — >B =^> AgC 
Observe that for any relational structure B, CSP(B) = 7(B). 

Let us define a notion of pathwidth relative to relational structures, which is the natural 
generalization of the notion of pathwidth over graphs, introduced by Robertson and Sey- 
mour [53]. We follow the lines of previous generalizations of similar notions as treewidth. 
For reasons that will be made clear later it is desirable to parameterize the ordinary no- 
tion of pathwidth to capture a finer structure. For this purpose we will consider not only 
the maximum size of any set of the path-decomposition but also the maximum size of its 
pairwise intersection. 

Definition 1. Let A be a r-structure. A path- decomposition of A is a collection Si, ... , S n , 
Si C A such that: 

(1) for every r-ary relation symbol R in r and every (ai, . . . ,a r ) G R A , there exists 
1 < i < n such that {ai, . . . , a r } C Si. 

(2) if a G Si fl Sj, then a G 5/ for alH < I < j. 

The width of the path-decomposition is defined to be the pair (max{|5j Pi Si + i\ : 1 < i < 
n — 1}, maxlJSil : 1 < i < n}). We say that a structure A has pathwidth at most (j, k) if it 
has a path decomposition of width (j, k). 

The concept of pathwidth relative to relational structures introduced here is intended 
to be a natural generalization of the notion of pathwidth defined over graphs |HH] ■ However 
there are some points in which our definition is not standard. First, in the ordinary notion 
of pathwidth over graphs, as defined in [HE], the width of a path-decomposition is defined as 
maxjlSjl : 1 < i < n} — 1. We are interested in a more fine-grained analysis that motivates 
the consideration, not only of the cardinality of the sets but also the cardinality of the 
intersection of two consecutive sets in the intersection. Furthermore, it is convenient for us 
not to subtract 1, as it is customary, as then this subtraction would had to be carried over 
all the paper. 

In order to delineate even more the precise relationship between the notion of pathwidth 
as it is usually defined over graphs and the notion of pathwidth of relational structures 
introduced in this paper we remark the following equivalence. 

Let A be a relational structure. The the following numbers are equal: 

• the pathwidth of the Gaifman graph of A plus one. 

• the minimum k such that A has pathwidth at most (k — l,k) 

We say that a set of structures C has pathwidth at most (j, k) if every structure A in 
C has pathwidth at most (j, k) . 



LINEAR DATALOG AND BOUNDED PATH DUALITY OF RELATIONAL STRUCTURES 



7 



5. Pebble-Relation Games 

In this section we will introduce a game, called (j, /c)-pebble-relation game, that captures 
expressibility in M^ k . 

Let Si and S2 be two (not necessarily finite) sets. A relation T with domain Si and 
range S2 is a collection of functions with domain Si and range S2. Remark: some confusion 
can arise from the fact that generally (and in this paper) the name relation is used with 
another meaning; for example, an r-ary relation over B is a subset of B r . Both concepts 
are perfectly consistent, since an r-ary relation over B is, indeed, a relation in our sense 
with domain {1, . . . , r} and range B. 

Let / be a function with domain S\ and range S2 , and let S[ be a subset of its domain 
Si. We will denote by f\gi the restriction of / to S±. Similarly, let T be a relation with 
domain Si and range S2, and let S[ be a subset of its domain S±. We will denote by Tig/ 
the relation with domain S[ and range S2 that contains f\gi for every / G T. For every 
relation T we denote by dom(T) the domain of T. We have two relations with domain 0: 
the relation {A} and the relation 0. 

Let < j < k be non-negative integers and let A and B be (not necessarily finite) r- 
structures. The (j, fc)-pebble-relation ((j, fc)-PR) game on A and B is played between two 
players, the Spoiler and the Duplicator. A configuration of the game consists of a relation 
T with domain / = {ai, . . . , a&'} C A, k' < k and range B such that every function / in T 
is a homomorphism from A|j to B. 

Initially 7 = and T contains the (unique) homomorphism from Am to B, that is, A. 
Each round of the game consists of a move from the Spoiler and a move from the Duplicator. 
Intuitively, the Spoiler has control on the domain I of T, which can be regarded as placing 
some pebbles on the elements of A that constitute I, whereas the Duplicator decides the 
content of T after the domain I has been set by the Spoiler. There are two types of rounds: 
shrinking rounds and blowing rounds. 

Let T n be the configuration after the ra-th round. The spoiler decides whether the 
following round is a blowing or shrinking round. 

• If the (n + l)-th round is a shrinking round, the Spoiler sets I n+1 (the domain of 
T n+1 ) to be a subset of the domain I n of T n . The Duplicator responds by projecting 
every function in T n onto the subdomain defined by I n+1 , that is, T n+1 = Tj" n+1 . 

• A blowing round only can be performed if \I n \ < j. In this case the Spoiler sets 
I n+1 to be a superset of I n with |/ n+1 | < k. The duplicator responds by providing 
a T n+1 with domain I n+1 such that Tj™^ 1 C T n . That is, T n+1 should contain some 

extensions of functions in T n over the domain I n+1 (recall that any such extension 
must be a homomorphism from Aijn+i to B). 

The Spoiler wins the game if the response of the Duplicator sets T n+1 to 0, i.e., the Du- 
plicator could not extend successfully any of the functions. Otherwise, the game resumes. 
The Duplicator wins the game if he has an strategy that allows him to continue playing 
"forever", i.e., if the Spoiler can never win a round of the game. 

Now, we will present an algebraic characterization of the (j, fc)-PR game. 

Definition 2. Let < j < k be non-negative integers and let A and B be (not necessarily 
finite) r-structures. We say that the Duplicator has a winning strategy for the (j, k) -pebble- 
relation game on A and B if there is a nonempty family 7i of relations such that: 
(a) every relation T has range B and domain / for some / C A with |/| < k. 
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(b) for every relation T in TL with domain /, ^ T and T C hom(A|/, B) 

(c) TL is closed under restrictions: for every T in TL with domain I and every /' C /, we 
have that Tj// G TL. 

(d) TL has the (j, /c)-forth property: for every relation T iaTL with domain / with \I\ < j 
and every superset I 1 of / with \I'\ < k, there exists some relation T' in TL with 
domain I 1 such that TL C T. 

Furthermore, if 7i satisfies the following condition we say that TL is a strict winning strategy 
for the (j, £;)-pebble-relation game. 

(d') TL has the strict (j, A:)-forth property: for every relation T in TL with domain / with 
\I\ < j and every superset /' of / with |/'| < k, the relation with domain I 1 given 
by {h G hom(A| 7 /,B) : hij G T} belongs to TL. 

The intuition behind the definition of a winning strategy is that every relation in a 
winning strategy corresponds to a winning configuration for the Duplicator in the game. 
We need an auxiliary definition that is used a number of times in the proofs. 

Definition 3. Let A be a T-structure, let a%, . . . ,a k be (not necessarily different) elements 
of A, and let v\, . . . , be variables. We denote by 0(A, ax,... , afc)(t>i, • • • , v^) the formula 
in O k,k , with equality, with variables among vx, ■ ■ ■ ,Vf. defined by 

A A R(v h ,...,v lp(R) )A A (vi = Vj ) 

R&t {a tl ,...,a ip{B) )e_R A X<i<j<k,a i =a j 

Notice that if all the elements ax , ■ ■ ■ , are different then we do not need the equality. 
The following properties of 0(A, ax, ■ ■ ■ , ak)(vx, ■ ■ ■ , v^) will be very helpful. 

Proposition 2. Let A be a r-structure, let ax, - ■■ ,o,k be elements of A and let 9 be the 
formula 0(A, ax,-.., ak){vx, ■ ■ ■ , Vk). Then we have 

• For every r-structure B and every b\, - - - , G B, we have that 

B, &!,... ,b k \= 6(vx, ...,v k )m A| {aij j(lfe} ,ai,. . . ,a k — ► B, b x , - - - , b k 

• For every quantifier-free formula <p(vx, ■ ■ ■ , v k ) in L k which statisfies A, ax, ■ ■ ■ , a k (= 
(f(vx, • • • , v k ) we have that 9{vx, ■ ■ ■ , v k ) implies <~p(vx, ■ ■ ■ , v k ), i.e., for every B and 
every bx, - ■ ■ , b k G B, 

B,bx,.-.,b k \= (0 => tp)(vx,. ..,v k ) 

The proof of these two simple facts about Q(A, ax, ■ ■ ■ , a k )(vx, ■ ■ ■ , v k ) follows the lines 
of the proof of a well-known result, due to Chandra and Merlin j^j which states that con- 
junctive query evaluation, conjunctive query containment, and deciding the existence of a 
homomorphism are essentially the same problem. 

The following results show that pebble-relation games, expressiveness in the existential 
positive fragment of finite- variable infinitary logic with restricted conjunction, and obstruc- 
tion sets of bounded pathwidth are equivalent mathematical embodiments of the same 
concept. 

Theorem 1. Let < j < k be non- negative integers and let A and B be (not necessarily 
finite) r-structures. The following statements are equivalent: 

(1) The Duplicator has a winning strategy TL for the (j, fc)-PR game on A and B. 

(2) For every sentence (p in M J,fc such that A |= ip we have that B |= cp. 
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(3) For every sentence ip in N^' such that A |= tp we have that B |= ip. 

(4) For every sentence ip in J ' fc such that A (= tp we have that B |= tp. 

(5) Every r-structure P with pathwidth at most (j, k) that homomorphically maps to 
A also homomorphically maps to B. 

The equivalences hold even if we allow the use of equalities in the formulas. 

Proof. (1)— >(2) Let TL be a winning strategy for the Duplicator. We shall show, by induc- 
tion on the construction of M^ k formulas, that if <p(v\, . . . ,v m ) is a formula of A'P ,k with 
equalities where the free variables of p are among v\, . . . ,v m , then the following property 
(*) holds: 

(*) For any elements a%, . . . , a m in A such that A, a%, . . . , a m \= tp{v\, . . . , v m ), there 
exists some T € TL with domain / C {cti, . . . , a m } such that for every h : {ai, . . . , a m } — > B 
such that h\i € T we have 

B,h{a{), . . .,h(a m ) \= tp(v%, . . .,v m ). 

First, it is easy to see that if tp(v\, . . . ,v m ) is a quantifier-free formula in M J,fe such 
that A, ai, . . . , a m \= tp(v\, . . . , v m ) and h : {a\, . . . , a m } — > B is any homomorphism from 
A |{ai,...,a m } to B then B,h(a{), . . .,h(a m ) \= ip(vi,.. . ,v m ). 

To see it, let define 6 as 6(A, a\, . . . , a m )(vi, . . . , v m ). By Proposition |^1 we have 
B, h(a\), . . . , h(a m ) \= 6(v\, . . . ,v m ) => p(v\, . . . ,v m ). Consequently, We can infer that 
B, h(ai), . . . , h(a m ) \= tp(vi, . . . , v m ) from the fact that B, h(a\), . . . , h(a m ) \= 6(v\, . . . , v m ). 

Thus, any relation T € TL with domain {a±, . . . ,a m } would satisfy condition (*). The 
inductive step for infinitary disjunction \J is straightforward using the induction hypothesis. 

Assume that the formula p(v\ , . . . , v m ) is of the form f\ where VP is a collection of for- 
mulas in M J ' fc . We have to show that there exists some T £ TL with domain I C {ai, . . . , a m } 
such that for every h : {a%, . . . ,a m } — > B with h\j G T we have B, h(a%), . . . , h(a m ) \= 
<p(vi, ... , v m ). Assume that there exists some formula 7 in \t which is quantified but not a 
sentence (such formula if exists must be unique). Since 7 has at most j variables we have that 
7 = ^{v h , . . . , Vi., ) where j' < j. Thus, A, a h , . . . , a*., \= 7(% ,...,v^), and by the induc- 
tion hypothesis there exists some T 1 € TL with domain I' C {a^ , . . . , cij ., } such that for all 
h : {oj 1 , . . . , ai ., } — > -B such that fyjv € T', we have B, /i(oj 1 ),..., h(ai ., ) |= 7(^1! , . . . , ., ) . 
If such formula does not exists then set I' = and T' = {A}. 

By the (j, fc)-forth property, there exists some relation T with domain {a±, . . . , a m } such 
that Tip C T'. We shall see that T satisfies the desired condition. Let h : {a\, . . . , a m } — > -B 
such that h ET, and let ip{y\, . . . , u m ) be any formula in We have to study three cases: 

• If ip(vi, . . . ,v m ) is quantifier-free then since h € hom(Ai/ ai) ... )am }, B), we have that 
B, h(ai), . . . , h(a m ) \= ip(vi, ...,v m ). 

• If ip(vi, . . . , v m ) is a sentence then by induction hypothesis we have B \= ip and thus, 
in consequence, B, h(a\), . . . , h(a m ) (= V'C^i) • • • > 

• Otherwise, we have ip(vi, . . . , v m ) = 7(«i 1 , • • • , v^,)- Since Tjj/ C T', we have 
B, ft-CoiJ, . . . , /i(aj .,) \= j{v h v ijf ). 

With respect to the existential quantification we shall distinguish two cases, depending 
on whether or not the variable quantified is contained in v±, . . . , v m . 

First, assume that the formula (p(v%, . . . , v m ) is of the form (3v)ip(vi, . . . , v m , v) (i.e., v 
{v\, . . . , v m }). Thus, there exists some a € A such that A, 01, . . . , a m , a \= ip(vi, . . . , v m ,v). 
Consequently, by the induction hypothesis, there exists some T' G TL with domain /' C 
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{ai, . . . , a m , a} such that for every h : {ai, . . . , a m , a} — > B such that G T' we have 
B, h(a±), . . . , h(a m ), h(a) \= ip(v±, . . . , v m , v). We consider two cases: 

• If a G {ai, . . . , a m } then we set 1 = 1' and T = T' . Let /i : {a±, . . . , a m } — > -B such 
that fyj € T. In consequence, we have B, h(a±), . . . , h(a m ), h(a) \= ip(vi, . . . , v m ,v), 
and, in consequence, B, h(a±), . . . , h(a m ) \= if)(vi, . . . ,v m ). 

• Otherwise, we set / = I'\{a} and T = TL (notice that if a $ I' then 1 = 1' 
and T = T'). Let h : {a±, . . . ,a m } — > B be a mapping such that fyj G T and 
let /i' : {a±, . . . ,a m ,a} be an extension of h\i such that /ijj, € T" (such extension 
always exists since T = TL). Thus B, h'(ai), . . . , h'(a m ), h'(a) \= ip(v i, . . . , v m ) and 
consequently B, h(ai), ... , h(a m ) \= ip(vi, . . . ,v m ). 

Secondly, assume that tp(vi, . . . , v m ) is of the form (3v)ip(vi, . . . , v m ). We can assume 
without loss of generality that {v±, . . . , v m i, v} = {vi, . . . , v m } for m! = m — 1. 

We have that A, a±, . . . , a m i \= <p(vi, . . . , tw). Thus, we are in the previous case and, 
in consequence, there exists some relation T with domain / C {a±, . . . ,a m >} such that for 
every h' : {a\, . . . , aw} — > i? with /ijj E T we have B, h'(a\), . . . , h'(a m >) \= <~p(vi, . . . , tw). 
Finally, it is easy to see that T satisfies property (*): Let h : {a\, . . . ,a m } — > S such that 
€ T and let /i' = fy{ai,...,a m /}- Since = /ij^ G T we have that B, h'(a±), . . . , h'{a m >) \= 
(pfa, . . .,v m >) and hence B, /i(ai), . . .,h(a m ) \= ip(vi, . . .,v m ). 

(2)->(3) and (3)->(4). Straightforward since 0^ fc C N^ k C MJ' fc . 

(4)— >(5) It will be convenient to assume that the path decomposition has a sort of 
canonical form. We will say that a path-decomposition S±,...,S n of a structure P is 
canonical if: (a) S n = and (b) for every 1 < j < n - 1 we have that C or 
<Si — Si + \. It is easy to verify that if a structure P has pathwidth at most (j, k), < j < k, 
then it has a canonical path-decomposition of width (j, k). 

Let P be a structure and let S±,...,S n be a canonical path-decomposition of width 
(j, k) of P where S\ = {p\, . . . ,p m }, m < k. We shall show, by induction on the size n of 
the decomposition, that there exists a formula <f(v±, . . . ,v m ) in 0^ k with variables among 
vi , . . . , v k and whose free variables are exactly v \ , . . . , v m such that for every structure D 
and every di , . . . , d m G D we have that 

P,pi, . . . ,p m — > D,di, ...,d m , 

if and only if 

D,di,. . . ,d m \= ip(v!,. . .,v m ). 

The result then follows from the following line of reasoning: Let P be any structure of 
pathwidth at most (j, k) and let, S±, ... , S m be the canonical path-decomposition that certi- 
fies its pathwidth. Then 0, Si, . . . , S m is also a path-decomposition of P and, consequently, 
there exists a sentence ip in O- 7 '^ such that for every structure D, P — > D iff D |= </?. Thus, 
if P — > A then A |= <p and by condition (4), B |= ip, and P — > B. 

The base case in the induction (n = 0) is easily proved by setting ip to the formula 
identically true which can be obtained as the conjunction of an empty set of formulas. 

For the induction step, let Si, . . . , S n , S n+ i be a path-decomposition of P of width (j, k). 
Thus S2, ■ ■ ■ , S n , S n+ i is a path-decomposition of P|5 2 u---us n+ i • We should distinguish two 
cases, depending on whether S\ C S2 or ^2 C Si. 

First assume that Si C S2. Assume, without loss of generality that Si = {pi, . . . ,p m } 
and S2 = {pi, . . . ,pi} where < m < I < k and m < j. Notice that since Si C S2, 
S2, . . . , S n+ i is a path-decomposition of P. Thus, by the inductive hypothesis, there exists 
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some formula tp(vi, . . . , v{) in J ' fc , such that for every r-structure D and every d±, . . . ,di € 
D 

P,pi, ...,pi — >B,d 1 ,...,di 

if and only if 

B,di,...,di \= (p(vi, ...,vi) 
Consider the formula tp{vi, . . . , v m ) in J > with variables among v\, . . . , and whose 
free variables are exactly v±, . . . ,v m defined by: 

if)(vi, ...,v m ) = (3v m+1 , . . .,vi)<p(vi,. ..,vi) 

We shall show that ij) satisfies the desired property. For every D and every d±, . . . , d m & D 
we have that 

P,Pl,---,Pm >'D,di,...,dm, 

if and only if 

3d m+ i, ... ,di £ D s.t. P,pi, ...,pi — ► D,di, ...,d h 
By the by the induction hypothesis is equivalent to 

3dm+i, ■ ■ ■ ,di G D s.t.D,di, . . . ,di \= ip(v-i, . . .,vi), 
which is equivalent to 

D,di,... ,d m \= (3v m+ i, . . ,,vi)tp(vi, ...,vi) = ip(vi, ...,v m ) 

Assume now that S2 C S\. Let Si = {p±, . . . ,p m } and ^2 = {pi, ■ ■ ■ ,pi}, where < 
I < m < k and I < j. By the inductive hypothesis there exists some formula <f[v\, . . . ,V[) 
in J ' fc , with variables among v\ , . . . , Vj. and whose free variables are exactly v\, . . . ,vi such 
that for every r-structure D and every d\, . . . ,di € D 

p |S 2 u-us n+ i,Pi, ■■■>Pl — >~D,di,...,di 

if and only if 

D,dx,...,d t \=<p(vx,...,vi). 
Let e{vi,...,v m ) G O j ' k defined by 9 = 0(P,pi, . . . ,p m )(«i, • • • 

,v m ). Notice that 

since pi,...,p m are different, 9 does not contain equality. Finally consider the formula 
ip(vi, . . . , v m ) in Oi' k defined by: 

ip(vi, ...,v m ) = 9(vi, . . . , v m ) A ip(vi, ...,vi) 

We shall prove that if) satisfies the desired property. Let D be a r-structure and let 
d%, . . . , d m be elements in D. Let h be any homomorphism from P to D such that h(pi) = di 
for 1 < i < m. 

Thus /i|5 x is a homomorphism from P^ to D and, by Proposition El D, d\, ■ ■ ■ , d m \= 
9{v\, . . . ,v m ). Also, we have that fys 2 u-uS„+i is a homomorphism from P|s 2 u---us„+i to D 
and by the induction hypothesis, D, di, . . . , di (= (p(vi, . . . , v{). Putting all together we have 
that 

D,di,...,d m \= 9(vi, . . . , v m ) A ip(vi, ...,vi) = ip(vi, ...,v m ) 
Conversely, suppose that D, d%, . . . , d m \= ip(vx, . . . , v m ). In consequence we have that 
D, d±, . . . , di \= (p(vi, . . . ,vi) and, by induction hypothesis, there exists some homomorphism 
h! from P|s 2 u— u5 n+ i to D such that h'(pi) = di, 1 < i < I. Let h : P — > D be the mapping 
defined by 

hi ) = I di tfp=Pi 
^ ' \ h'(p) otherwise 
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We shall show that h defines a homomorphism from P to D. First notice that since 
D,di, . . . ,d m \= 9(vi, . . . , v m ), h\s 1 defines a homomorphism from P| 5l to D. Finally, 
notice that fyg 2 u— uS n+ i = ^' defines a homomorphism from Pig 2 u— uS n+1 to D. Then we 
are at home: Let R be any relation symbol in r of arity, say r, and let (ai, . . . , a r ) any tuple 
in R p . Since Si, ... , S n +i is a path-decomposition of P we have that {ai, . . . ,a r } G Sj for 
some j G {1, . . . ,n+ 1}. If j = 1 then we have that (h(ai), . . . , h(a r )) in i? B , as fygj defines 
a homomorphism from P^ to D. If, otherwise, j > 1, then (/i(ai), . . . , h(a r )) is in i? D , as 
h \S 2 u-uS n+1 defines a homomorphism from P\s 2 u-uS n+1 

(5)^(1) We shall produce a winning strategy TL for the Duplicator. For every structure 
P, for every path-decomposition Si,...,S n of width (j,k) of P, and for every mapping 
h : P — > A such that: 

(i) h is a homomorphism from P to A and 

(ii) is one-to-one and furthermore {h^) -1 is a homomorphism from A^g^ to Pg x . 
The set TL contains the relation T with domain h(Si) defined by 

T = {g lSl o(h lSl )- 1 :P^B} 

First, notice that if P is the structure with universe P = 0, and h and # are A, then 
we have that {A} G TL, and hence, TL is non-empty. We show now that TL has the required 
properties: 

• Clearly, every relation T in TL has domain / C A with \I 1 < k and range B. 

• We have to show that for every T with domain / and every / G T, f is a homo- 
morphism from A|/ to B. First, assume that / = g\s 1 ° h\h~(Si) w ^ ere 9-> h an< ^ &i 
are as defined above. Since fiT^ is a homomorphism from A^g^ to 'P\s 1 an d g\s 1 
is a homomorphism from Pig 1 to B, then its composition / must be a homomor- 
phism from A|^(5 X ) to B. Furthermore, T is non empty, since P — ► A implies that 
P — >B. 

• We have to show that TL is closed under projection. Let T be any relation in Ti, 
obtained from P, Si, . . . , S n and h as defined above. In consequence T has domain 
h(Si). Let / C h(Si) and let S[ = h' 1 ^). It is not difficult to see that S[,S 1 ,...,S n 
defines a path decomposition of P, that fyg/ is one-to-one, and that (h^/J^ 1 defines a 

homomorphism from A|/ to P\s' • Thus the relation T' = {g\s[ ° {h\S' 1 )~ 1 '■ ^*\S[ 
B} with domain / belongs also to TC. We shall show that Tu = T'. For every 
homomorphism /' from Au to B, /' G T' iff there exists some homomorphism g 
from P to B such that /' = g\ S i oh\gi . In consequence, /' G T' iff / = g\$ L hs x G T. 
Finally, the result follows from the fact that /ig/ = /'. 

• We have to show that TL has the (J, fc)-forth property. Let T be any relation in TL 
with domain / obtained from P, S±, . . . ,S n and h as defined above and let /' be 
any superset of I with |/'| < k. Let I'\I = {a[, ... ,a[} and let p[, . . . ,p[ elements 
not in P. Let P' = P U {p[, . . . ,p' t }, let S[ = Si U {p[, ...,p' l } and let ti : P' -»• A 
be the extension of h that maps p\ to a- (1 < % < I). That is 

tiip) = { a>i if P = P'v 1 ^ i ^ 1 
\ h(p) otherwise 
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We define the structure P' as the structure with universe P' such that for every R 
in r, 

R*' = i?P u {( Pl7 . . . , Pm ) : (ti( Pl ), h'(p m )) G R A , { Pl , . . . ,p m } C S[} 

Notice that by construction S[, S\, . . . , S n is a path-decomposition of width (J, k) 
of P' and that h' is a homomorphism from P' to A such that h'^ s , is one-to-one 

and that (h'^,)^ 1 is a homomorphism from A|// to P|sj- Let T' be the relation 
obtained from P', S[, Si, . . . , S n and h! . We shall see that TL C T: Let / be any 
function in T' . Thus, / = ggj o (h'\ S i ) _1 for some homomorphism g from P' to 
B. Thus, 9 , |5 1 u-uS„ defines a homomorphism from P to B and, in consequence, 
f\i = 9\Si (^ISi) -1 belongs to T. 

□ 

The following theorem provides us with several alternative ways to characterize M^ ,k 
definability. 

Theorem 2. Let < j < k be non-negative integers and let C be a class of r-structures. 
The following statements are equivalent: 

(1) The class C is M^-definable, i.e., there is a sentence ip of M J ' such that for every 
r-structure we have that A G C iff A |= ip. 

(2) If A and B are r-structures such that A G C and the Duplicator has a winning 
strategy for the (j, fc)-PR game on A and B, then B G C. 

Furthermore, if ->C is a finitely generated ideal we also have that (1) and (2) are equivalent 
to the following: 

(3) The class C is A"' fc -definable. 

(4) The class —>C has an obstruction set of pathwidth at most (j, k) 

Proof. First, we will need the following definition. 

Let <S be a finite collection of T-structures. We say that S is disjoint if for every different 
A,B G <S, AnB = 0. If S is disjoint we define ©5 as the T-structure whose universe is the 
union of the universes of all the structures in S, and such that for every relation symbol 
R G r, i?® 5 = U Ae s Observe that, as the universes of the structures in S are disjoint, 
if S has pathwidth at most (j, k) for some j, k then ©5 has also pathwidth at most (j, k). 

If S = {Ai, . . . , A m } we also denote ©5 by Ai © • • • © A m . 

Let B be a T-structure and let S be a set of r-structures. It is easy to verify that 
©5 — > B iff A — y B for every A G S. 

Now we are in a position to prove Theorem [2 

(1) — y(2) Straightforward from Theorem^ Assume that A |= (p. Since the Duplicator 
has a winning strategy for the (j, fc)-PR game then B |= p and thus B belongs to C. 

(2) ^(1) Clearly C is a filter and let ->C be I(S) for some set S of r-structures. For 
every B in S, let be the collection of sentences in N^' k falsified by B and let p-& = \J $b, 
which is a sentence of N^ ,k . Let \E' be the set containing (^b for every B in S, and let <p 
be the sentence of defined by <p = /\^ ■ Notice that if S is finite then p is in N^ ,k . 
We shall see that ip defines C. Let A be any structure not in C. Thus, there exists some 
B in S such that A homomorphically maps to B. Recall that if A is homomorphic to B, 
by Proposition^ B satisfies all sentences in M u satisfied by A. In consequence, A \/= tps 
and thus A ^ p. Conversely, let A be any structure such that A ^ <p, thus A ^ (p& for 
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some B in S. Thus, every formula i/j in N^ ,k satisfied by A is also satisfied by B, and, in 
consequence, the Duplicator has a winning strategy for the (j, fc)-pebble-relation game on 
A and B. Since B C we conclude that A C. 

(3) ->(l) Trivial as N^ k C M^ k . 

(4) — >(1) Straightforward from Theorem^ Let A £ C and B C be r-structures. 
Since A £ C there exists a structure P in the obstruction set (and therefore of pathwidth at 
most (j,k)) that homomorphically maps to A. Since B £ C, P does not homomorphically 
map to B. In consequence, there does not exist a winning strategy for the Duplicator for 
the (j, fc)-pebble-relation game on A and B. 

(2)^(4) Let Bi,...,B m be the finite set of structures that generates -C. Consider 
the set O given by the collection of all structures of the form Pi © • • • © P m where Pj is a 
r-structure of pathwidth at most (j, k) that does not map to Bj, and Pj, 1 < i < m have 
disjoint universes. Thus, every structure in O has pathwidth at most (J,k). We shall show 
that O is an obstruction set of —>C. First, observe that every structure Pi © • • • © P m in O 
belongs to C, as for each i £ {1, . . . , m}, Pj > Bj. Consequently, if one of such structures 
Pi © • • • © P m is homomorphic to a given structure A, then A must be in C, as C is a 
filter. Conversely, let A be any structure in C. Thus, for every 1 < i < m, the Duplicator 
does not have a winning strategy for the (j, fc)-pebble game on A and Bj. Thus, for every 
1 < i < m there exists some r-structure Pj of pathwidth at most (j, k) such that Pj — ► A 
and Pj -/— » Bj (we can assume without loss of generality that the universes of Pj, 1 < i < m 
are disjoint). Thus Pi © ■ • ■ © P m belongs to O and furthermore it homomorphically maps 
to A. 

(2)— >(3) The structure of this proof is similar to the previous case. 

Let Bi, . . . , B m be the finite set of structures that generates ->C. Consider the set ^ 
given by the collection of all structures of the form ip\ A • • • A ip m where ipi is a sentence 
in 0^' k that is not satisfied by B j . Then the formula ip defined to be V $ is in N^ k . Let 
us show that ip defines C. First observe that for every Bj, 1 < i < m generating -iC, we 
have that Bj ^ ip. Consequently, as ->C is an ideal, by Proposition for any structure A 
not in C, A \/= ip. Conversely, let A be any structure in C. Thus, for every 1 < i < m, the 
Duplicator does not have a winning strategy for the (j, fc)-pebble game on A and Bj. Thus, 
for every 1 < i < m there exists some sentence ipi in O j ' that is true on A but not true on 
Bj. Thus ipi A • • • A ip m is true on A and belongs to ^. Consequently, <p is true on A. □ 

6. Datalog Programs 

Let r be a vocabulary consisting of relational symbols. The class SNP |2(iM32j is the set 
of all existential second-order sentences with a universal first-order part, i.e., sentences of 
the form ELSi, ■ ■ ■ > Stfvi, . . . , v m <p(vi, . . . , v m ) where cp is a quantifier-free first-order formula 
over the vocabulary t U {Si, . . . , 5/} with variables among vi, . . . ,v m . We will assume 
that <p is in CNF. We consider some restrictions that can be enforced on the class SNP. 
For monotone SNP ^2]; we require every occurrence of a relation symbol from r to have 
negative polarity, i.e., a negation applied to it. For j-adic SNP, we require every second- 
order variable S{, 1 < i < I to have arity at most j. For fc-ary, we require the number of 
variables universally quantified m to be at most k. For Krom SNP we require every clause 
of the quantifier-free first-order part <p to have at most two occurrences of a second-order 
variable. 

Every Krom SNP formula ip over the vocabulary r defines a class of r-structures, 
namely, the set containing every r-structure A such that A |= (p. Furthermore, the problem 
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of deciding, given a r-structure A, whether A |= tp is solvable in NL |14j . Thus expressibility 
in Krom SNP is a sufficient condition for membership in NL. 

For restricted Krom SNP we additionally require every clause of the quantifier-free first- 
order part p to have at most one positive occurrence of a second-order variable and at most 
one negative occurrence of a second-order variable. 

Theorem 3. Let < j < k be non-negative integers and let B be a r-structure. There 
exists a sentence p in j-adic k-aiy restricted Krom monotone SNP with equalities such that 
for every r-structure A, A \= ip iff the Duplicator has a winning strategy for the (j, fc)-PR 
game on A and B. 

Proof. The proof of Theorem |3] requires some intermediate results. In a first step we define 
a different notion of winning strategy, called "supercomplete winning strategy" and we show 
that this new notion is equivalent to the notion of winning strategy introduced initially, that 
is, we prove that for every r-structures A and B, and for every < j < k the Duplicator has 
a winning strategy for the (J, fc)-pebble game if and only if it has a supercomplete winning 
strategy. In order to prove this we define several different but equivalent notions of winning 
strategy that will act a links in a chain of inferences. 

The new notions of winning strategy are the following: 

Definition 4. Let < j < k be non-negative integers and let A and B be r-structures. 
We say that the Duplicator has a complete winning strategy for the (j, fc)-pebble-relation 
game on A and B if there is a family TL of relations such that: 

(a) every relation T has range B and domain / for some I C A with |/| < j. 

(b) TL contains {A} and does not contain 0. 

(c) for every I Q A with |/| < k, every relation T in TL with domain I' Q I and every 
I" C I with 1 1" | < j, the relation with domain I" given by 

{h\in : h £ hom(A|j, B), h\ji £ T} 

belongs to TL 

Furthermore, if TL satisfies the following condition we say that TL is a supercomplete winning 
strategy for the (j, fc)-pebble game 

(d) For every T in TL and every T C T" we have that T' £ TL 

The following claim states that all the different characterizations of winning strategy 
introduced so far are equivalent. 

Claim 1. Let < j < k be non- negative integers and let A and B be r-structures. The 
following statements are equivalent: 

(1) The Duplicator has a winning strategy for the (J, A;)-PR game on A and B. 

(2) The Duplicator has a strict winning strategy for the (j, A;)-PR game on A and B. 

(3) The Duplicator has a complete winning strategy for the (j, fc)-PR game on A and 
B. 

(4) The Duplicator has a supercomplete winning strategy for the (j, fc)-PR game on A 
and B. 

Proof. (1)^(2) Let TL be a winning strategy for the Duplicator for the (j, /c)-PR game on 
A and B. We define TL* to be the set 

{T* :dom(T*) C A, | dom(T*)| < k,T* C hom(A dom(T) , B), 3T e TL s.t. T C T*} 

We will show that TL* is a strict winning strategy: 
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• It is obvious that 77* satisfies conditions (a) and (b). 

• 77* is closed under restrictions: Let T* a relation in 77* with domain 7 and let I' C 7. 
Thus, there exists some T in 77 with domain 7 such that T C T*. Since 77 is closed 
under restrictions Tjyv G 77. Since Xjj/ C T* T , we have that T* r G 77*. 

• 77* has the strict (j, A:)-forth property: Let T* be a relation in 77* with domain 7, 
with |7| < j and let I' be superset of 7 with |7'| < k. Thus, there exists some 
relation T in 77 with domain 7 such that T C T*. Since 77 has the (j, /c)-forth 
property there exists some relation T' in 77 with domain I' such that T'j C T. In 
consequence T' C {/i G hom(A| 7 /,B) : h\j G T*}. Thus, the latter is in 77* 

(2) ^(3) Let 77 be a strict winning strategy for the Duplicator and let 77* be the 
collection of relations in 77 with domain of size at most j. We shall show that 77* is a 
complete winning strategy. It is straightforward to show that 77* satisfies conditions (a) 
and (b) of the definition of a complete winning strategy. For condition (c), let 7 C A with 
|7| < k, let T* be a relation in 77* (and hence in 77) with domain I' C 7 and let I" C 7 with 
1 7" | < j. By the strict (J, fc)-forth property we have that the relation of arity 7 given by 
{h G hom(A|j, B), h\j* G T*} belongs also to 77 and so its projection to I" which is given 
by {h\ r i : h G hon^Aij, B), h\ r G T*}. Since < j, then it also belongs to 77*. 

(3) ^(4) Let 77 be a complete winning strategy and let 77* be the set containing 
every relation T* such that there exists a relation T G 77 with T C T*. Clearly 77* 
satisfies conditions (a), (b), and (d) of a supercomplctc winning strategy. For condition 
(c), let 7 C A with |7| < k, let T* be a relation in 77* with domain I' C 7 and let 
7" C 7 with |7"| < j. There exists some T G 77 such that T C T* and, in consequence 
{fy/// : /i G hom(A|/, B), h\p G T} is a subset of {h\jn : h G hom(A|/, B), h\p G T*} and 
hence the latter is in 77*. 

(4) ^(1) Let 77 be a supercomplete winning strategy and let 77* be a set containing for 
every 7 C A with |7| < k, and for every T in 77 with domain I' C 7, and for every I" C 7 the 
relation : /i G hom(A|j, B), h\p G T}. It is straightforward to show that 77* satisfies 
the conditions (a), (b), and (c) of a winning strategy. For condition (d), let T" be any 
relation in 77* with domain I" with |7"| < j. Thus, T" = {h\ V , : h G hom(A| 7 , B), h\ V G T} 
for some T in 77 with domain 7' and some 7' C 7 with |7| < fc. Then, T" is also in 77. Now, let 
I'" be any superset of I" with < k. Thus, T'" = {h\ P ,, : h G hom(A| 7 ,», B), h\ r , G T"} 
is also in 77 and T"/,, C T". 

□ 

In a second step we reformulate having a supercomplete winning strategy as the exis- 
tence of a structure satisfying certain properties. Let r be a non-negative integer and let B 
a set. Recall that an r-ary relation over B is a relation with domain {1, . . . ,r} and range 
B. 

Let {U n : 1 < n < m} be the collection of all j-ary relations over i? and let {Ru„ '■ 1 < 
n < m} be a collection of j-ary relation symbols, one for each relation U n : 1 < n < m. 
We shall show that the Duplicator has a superstrict winning strategy for the (j, A:)-pebble- 
relation game on A and B iff there exists some structure A' with domain A over the 
vocabulary {Ru n : 1 < n < m} such that: 

(a) (/7 ) A ' = 0. 
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(b) For every (not necessarily different) a±,...,ak € A, and for every 1 < h, . . . ,lj < k, 
(a h , . . . , ai d ) G (Ru) A where 

U = {(h(a h ), . . . , h{a h )) : h G hom(A {ai) ... )0fc} , B)} 

(c) For every (not necessarily different) a\,...,ak elements in A, for every relation 

symbol Ru n A < n < m, for every (a^ , . . . , a*. ) G {Ru n ) A an d for every 1 < 

A' 

Zi, . . . , Ij < k, we have that (a^ , . . . , aj . ) G (i?c/) where 

17= {(/i(a h ),..., Zi(a^.)) : Zi G hom(A {ai; . j(Jfc} , B), (Zt^ ),..., Zi(a^.)) G £/ n } 

(d) For every 1 < n, n' < k such that C/ n C U n > we have (i?[/ n ) A ' C (Ru nl ) A ' 

We shall say that A' is a relational winning strategy for the Duplicator for the (j, fc)- 
pcbble-relation game on A and B. 

Claim 2. Let < j < k be non-negative integers and let A and B be r-structures. The 
following statements are equivalent: 

• The Duplicator has a supercomplete winning strategy for the (j, Zc)-pebble relation 
game on A and B. 

• The Duplicator has a relational winning strategy for the (j, Zc)-pebble relation game 
on A and B. 

Proof. First, we need the following definition, let a\, . . . , ai be a collection of (not neces- 
sarily different) elements of A and let T be a relation with domain {a\, . . . , a{\. We define 
U(T, ai, . . . , ai) be the Z-ary relation over B given by 

{{h( ai ),...,h( ai )):h€T} 

Let Ti be a supercomplete winning strategy for the Duplicator for the (j, /c)-pebble on 
A and B and let A' be the {Rjj n ■ 1 < n < m}-structure such that for every 1 < n < m, 

( R u n ) A ' = { (ai,...,a,j) : a\, . . . ,aj G A, 

3T G H, {oi, . . . , a,} = dom(T), U(T, oi, . . . , a,) C Z7 n } 

We shall show that A' satisfies the desired conditions: 

(a) By definition every relation T in H is non-empty. Thus (Rid) a ' = 0. 

(b) Let ai, . . . , Ofc be (not necessarily different) elements in A and let 1 < Zi, . . . , lj < k. 
We shall show that (a^ , . . . , aj .) G (Ru) A where 

[/ = {(fc(a h ),... ,/i(a^.)) : Zi G hom(A {aiv .. )0fc} , B)} 

Since TL is an supercomplete winning strategy, then the relation 

T = {h\ { a h ,..., ai .} ■ h G hom(A| {ai; ... )afc} ,B),Zi| G {A}} 

which is equivalent to 

{h\{a h ,..., ai .} ■ feGhom(A| {oi) ... >afc} ,B)} 

belongs to H. In consequence we have (a^ , . . . , a^. ) G (Ru(T,a h ,...,a l .)) A ' ■ Finally, 
we have U(T,a h ,..., a tj ) = U. 
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(c) Let ai,...,ak be (not necessarily different) elements in A, let Rjj n be a relation 
symbol, let . . . ,aj.) G (Ru n ) A and let 1 < Zi, . . . , Zj < k. We shall show that 
(a h ,ai 3 ) G (Ru) A ' where 

U = {(h(a h ),..., Zt(a; J )) : Zi € hom(A {oiv .. >afc }, B), (^(a^ ),..., Zi(a^)) G C/ n } 

First, there exists some T G TC that has domain {a^, . . . , aj . } and such that the 
inclusion Z7 (T, , . . . , a^. ) C ?7 n holds. Since TL is a supercomplete winning strategy 
we have that 

T' = {fc|{ 0|l ,..,«,,.} : h G hom(A| {ai) ... iafc} ,B),fe| {a . iv .. )0 .. } G T} 

^ {^{a^,...,^.} : /» G hom(A|{ aii ... )afc },B),(fe(ai 1 ),...,fe(a ij .)> G U n } 

belongs also to H. 

Thus, (ai i: . . . , a^) belongs to (Ru(T',ai ,...,ai.)) A an d ^ is n °t difficult to see that 
U(T',a h ,..., aij )CU. 

(d) Straightforward from the definition. 

For the converse, let A' be a {Ru„ : 1 < n < m}-structure satisfying (a), (b), (c), and 
(d). Let H be the collection of relations 

{{A}} U {T : 3ai, . . . , aj G A, dom(T) = {a±, . . . , aj}, (ai, . . . , cij) G (Ru(T,ai,...,aj 

)) A '} 

We shall show that is an supercomplete winning strategy for the Duplicator for the 
(j, Zc)-pebble-relation game on A and B. It is immediate to show that TL satisfies conditions 
(a), (b) and (d) of the definition of a superstrict winning strategy. For condition (c), let 
I C A with |/| < k, let T be a relation in TL with domain I' C. I and let I" C / with 

< j. We have to show that T' = {h\ r > : h G hom(A|j, B), h\ V G T} belongs to TL. We 
will do a case analysis. 

First assume that I" = 0. In this case, since property (b) of the definition of super- 
complete winning strategy is satisfied we have that T' = {A} which belongs to TL. Now 
assume that I" ^ 0, say / = {a±, . . . , a^} (here ai, . . . , are not necessarily different) and 
I" = {a h , . . . , a tj } 

• First consider the case I' = 0. Thus T = {A} and T' = : h G hom(A|j,B)}. 
From condition (b) we have that (a/ 1 , . . . , aj. ) G (Ru) A where 

U = {(h(a h ), . . . ,h( a lj)) ■ h e hom ( A {ai,...,a fc }' B )} = U(T',a h ,. . . ,a tj ), 
and thus T' G TL. 

• The case I' ^ is proven similarly. In this case we have that there exist some 
a^,...,^. G I such that I' = {a h , . . . , a^} and {a il , . . . , a^) G (Ru(T,a n ,...,a tj )) A ' ■ 
Thus, T' = {h\j/r : h G hom(A|j, B), h\p G T} is identical to 

{h\{a h ,..., ai .} ■ h G hom(A| {aii ... !afc },B),(/t(oj 1 ),...,Zi(a^)) G (i?[/(T ,) A '( 
From condition (c) of relational winning strategy we have that (<% , . . . , a,^ ) G 
(Ru) A> where 

U= {(h(a h ), . . . ,h(a tj )) : h G hom(A| {ai; _ j0fc} , B), 

(Zi(a il ),..., Zi(a;.)) G (i2 t 7(T ) o il ,..,o ij .))) A '} 
which is equal to U(T', a/ x , . . . , aj.); henceforth, T' G H. 

□ 
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After the equivalence of supercomplete winning strategies and relational winning strate- 
gies has been established, we shall construct a sentence <p over the vocabulary r U {=} that 
tests whether such a structure A' (certifying the existence of a relational winning strategy) 
exists. This is our third (and final) component of the proof of Theorem |31 

Claim 3. For every B there exists a sentence tp over the vocabulary r U {=} such that 
for every r-structure A, A |= (p iff the Duplicator has a relational winning strategy for the 
(j, /c)-PR game on A and B. 

Proof. The sentence <p has a second-order predicate Ru i of arity j for every j-axy relation 
Ui,l < i < m over B. Thus 

(p = BRu!,- ■ . ,i?r/ m V«i, . . .,v k ip(vu. ■ -,v k ), 

where ip(vi, . . . , v k ) is a first-order formula over the vocabulary r U {=, Ru 1 , . . . , Ru m } that 
is quantifier-free and has variables among v\,...,Vk. Let us describe ip. The formula ip 
will be a first-order formula in conjunctive normal form. First we need some auxiliary 
definitions, let t' be r U {=, Ru x , ■ ■ ■ , Ru m } an d let B' be the -r'-structure with universe B 
such that for every R G r, R B = R B , (=) B = {(b, b) : b £ B}, and such that for every 
1 < i < m, (22oi) B ' = Ui. 

A disjunctive formula j(v±, . . . , v k ) is any first-order quantifier-free formula obtained as 
the disjunction of some (possibly negated) predicates in r' applied to variables in v%, . . . , v k - 
7 is monotone if every occurrence of a predicate in r' is negated. 

Let 7(^1,..., Vk) be any disjunctive monotone formula over the vocabulary r' with 
variables among vx, . . . , Vk and let 1 < i\, . . . , ij < k, be a collection of indices. We define 
[7(7, ii, . . . , ij) as the j-axy relation over B defined by 

{(b il ,...,b ij ) : B',b 1 ,...,b k ^ 7(^1, . . . ,v k )} 

We are now in a position to describe ^>(i>i, . . . , v k ). The formula ip is of the form f\ with 
= if?! u ^ 2 U ^3 where: 

• Vl/i contains the formula —iRq^u^ , . . . , for every 1 < l\, . . . , lj < k. 

• ^2 contains for every disjunctive monotone formula 7 over the vocabulary r' with 
at most one occurrence of a second-order predicate and every collection of indices 
i<ii,...,ij < k, the formula (clause) 7(^1, ... ,v k )\J i?j/( 7 ,ii,...,i i )(% > • • • > No " 
tice that such clause is monotone and restricted Krom, although it might contain 
equalities 

• contains for every 1 < n, n' < m such that U n C U' n , and for every 1 < h, ■ ■ ■ ,lj < 
k the formula ~^R Un (v h , . . . , v^) V R Un , {v h ,...,11^.). 

Informally, each one of the subsets 1,^/2, ^3 of encodes a condition of the definition 
of relational winning strategy. Indeed, it is not difficult to see that /\ ^1 is equivalent 
to condition (a) of relational winning strategy and that f\ ^3 formulates condition (d) of 
relational winning strategy. It is also possible to see, although this case is certainly more 
complicated, that /\ ^2 encodes exactly conditions (b) and (c) of relational winning strategy. 
The intuition here is that the collection of all formulas 7(^1, . . . , Vk)\/Ru(-y,ii,...,i )( v ii > ■ ■ ■ 3 v ij) 
in ^ where 7 does not contain a second-order predicate encodes (b) whereas the set of all 
such formulas with 7 containing one occurrence of a second-order predicate encodes (c). 

In the following we shall make all this more precise. 
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Let A' be a r'-structure. We shall show that A' |= Wi, . . . ,v k ip(vi, . . . ,v k ) iff A" = 
A'[{Ru n : 1 < n < m}] is a relational winning strategy for the Duplicator for the (J, fc)-PR 
game on A = A [r] and B. It is easy to observe that this implies our claim. 

First, assume that A' (= VVl, . . . , v k ip(vi, . . . , v k ). Thus, for every a±, . . . , a k in A, 
A',at,...,a k \= ip(vi,...,v k ) = /\*- Since for every 1 < h, . . . ,lj < k, ^R^{v h , . . . ,vi } ) 
is in \I/ we have that (aj 1? . . . , a/ .) (-R(j) A • Thus (-R(j) A = and, in consequence, A" 
satisfies condition (a) on the definition of relational winning strategy. 

We shall show that A" satisfies (b). Let oi, . . . , a k be (not necessarily different) elements 
of A. Let 9(vi,...,v k ) = 6(A, a 1} . . . , a k )(vi, . . . , v k ). 

Thus, we have A',a 1: ...,a k |= -i0(vi, . . . , v k ) V Ru(-,e,h,...,lj)( v h> ■ ■■■> v l j )- Since by 
Proposition^! A', a 1} . . . , a k ty= ->9(vi, v k ) we have that (a h , . . . , a tj ) G {Ru^eji,...,^))^ ■ 
Finally we have: 

u^e,h,...,ij) ={(b h ,...,b lj ):B',b 1 ,...,b k ^^e(v u ...,v k )} 

= {(K,- ■■ Mj) ■ A |{a lv ..,a fc },«l,- • • ,a k — >B',bx,.-.,b k } 
= {(h( a h), • • -,h(%)) ■ h G hom(A| {oir .. )afe} ,B)} 

We shall show that A" satisfies condition (c). Let ai,...,a k be (not necessarily dif- 
ferent) elements of A, and let 9 be defined as above. Let R\j n be a relation symbol, let 
(a^, . . . , Oj.) G (Ru„) A an d let 1 < li,...,lj < k. Let 7(^1, . . . , v k ) be the disjunctive 
monotone formula defined by 7(^1, . . . , v k ) = -<8(vi, . . . , v k ) V -<Ru n (vii , ■ ■ ■ > Vy). There- 
fore the formula j(vi,...,v k ) V (Ry,^,...,^)^, . . . , v^) is in \t. Thus, A", ai, . . . , a k \= 

( R U(-y,h,-,ij))( v hi ■ ■ ■ , v lj) and in consequence (a h ...,ai.) G (Rufr,h,...,l :i )) A ~" ■ Finally we 
have: 

U(j,h,...,lj) ={ (b h . . . ,bij) ■ B',6i, . . . A ^ 7(^1, . . . ,v k )} 
= { {bh ■ ■ -M 3 ) ■ B',6i, ...,b k ^ • • • ,v k ), 

(6 n ,...,6 lj )G(i? l/ J B ' = C/ n } 

= { (H a h)i---> h ( a h)) : h G nom ( A |{a lv ..,a fc },B), 

(h(a h ), h(a i:j )) G U n } 
We shall show that A" satisfies condition (d). Let 1 < n,n' < m such that U n C U n i, and let 
(a«i ,■■■,%) G (i?[/ n ) A " . Since -^R Un (v h , . . . , v tj ) V (uji , • • • , ) is in * 3 we have that 
A', 01 , . . . , a k \= iR Un (v h , . . . , v tj ) V R Un , (v h , . . . , ) and, in consequence, (o Zl , . . . , a t . ) G 

( R u n ,) A ". 

For the converse, assume that A" is a relational winning strategy. We shall show that 
for all ai, . . . ,a k (not necessarily different) elements in A and for every x( u i> • • 3 v k) i n ^> 
A', 01, ... ,ajfc |= x(vi, ■ ■ .,v k ). First, if x G *i then x = ^R${v h ,. . . ,v tj ) for some 1 < 
h,...,lj < k. In this case A', ai, . . . , a fe |= • • • , ujfc) since (a i]L , . . . ,a^) G" (i?0) A " = 0. 
Secondly, if x G ^2 then we have x(ui, • .., ujfe) = 7( u i> ■ ■ ■ > v k) V -R[/( 7 ,h,...^)( u «i> • • • > 
where 7(^1, . . . , v k ) is a disjunctive monotone formula with at most one occurrence of a 
predicate in {Ru n : 1 < n < m}. Assume first that 7(101, . . . , v k ) is a formula over r U {=} 
(that is, it does not contain a second-order predicate). Thus, if A, ai, . . . , a k 7(^1, . . . , v^) 
implies the following. Let 6(v 1, . . . , v k ) = 0(A, a\, . . . , a k )(v%, . . . , v k ). Then, by Proposi- 
tion 6(vi, ... ,v k ) implies ^j(vi, . . . ,v m ). 



LINEAR DATALOG AND BOUNDED PATH DUALITY OF RELATIONAL STRUCTURES 



21 



Thus 

U(j,h,. . = {(b h ,. . . ,b tj ) : B',&i,. . . ,b k \£ j(vx, . . -,v k )} 

D {(b h ,...,b lj ):B',b 1 ,...,b k ^6(v 1 ,...,v k )} 
= {(b h ,. ..,bi d ) : A{ air .. )afc },ai, ...,a k — ► B, bj, . . . , b k } 
= {(Kaij): . . -,h(aij) ■ h e hom(A| {oir .. )0fc} ,B)} 

Let U be {(/i(a/ 1 ),..., h(ai j ) : h € hom(A|{ ai 0fc }, B)}. Since A" is a relational 
winning strategy, then (a^ . ..,aj.) € (Ru) A ■ Furthermore, since U C U(pf,l\, . . . we 

have that (a Zl , . . . ,c^.) G 0R[/( 7 ,Ji,...,y ) A "- 

The proof when 7 contains a predicate in {-R[/ n : 1 < n < m} is analogous. Finally it is 
straightforward to verify the case x £ ^3 using condition (d) of relational winning strategy. 

□ 

Finally, the proof of Theorem [31 is a consequence of Claims Q [21 and 01 Q 

Restricted Krom SNP formulas can be regarded alternatively as a particular type of 
Datalog programs called, linear Datalog Programs. 

Let t be a vocabulary. A Datalog Program over r is a finite set of rules of the form 

^0 - — "1 j • • • ; 

where each ti is an atomic formula R{v\, . . . ,v m ). The relational predicates that occur in 
the heads of the rules are the intensional database predicates (IDBs) and do not belong to 
r, while all others are the extensional database predicates (EDBs) and must belong to r. 
One of the IDBs is designated as the goal of the program. Note that IDBs may occur in the 
bodies of rules, and, thus, a Datalog program is a recursive specification of the IDBs with 
semantics obtained via least fixed-points of monotone operators (see |36|). 

A Datalog Program is called linear if every rule contains at most one occurrence of a 
IDB in its body. Let < j < k be non-negative integers, (j, A;)-Datalog is said to be the 
collection of all Datalog programs in which every rule has at most k variables and at most 
j variables in the head. 

Let us introduce with a bit more detail the semantics of Datalog Programs. 

Let Q be a Datalog Program over r, let tidb be the set of its intensional predicates, 
and let t'=tU Tidb- The Datalog Program Q defines an operator $ : STR[r'] — > STR[r'] 
in the class of r'-structures in the following way: For every r'-structure A, <3?(A) is defined 
to be the smallest r'-structure with the same universe A of A such that: 

• for every R € r', # A C R^ A \ 

• for every rule to : ~ ■ ■ ■ > t m in Q, with variables ui, . . . , u n , and for every interpre- 
tation h : {ui, . . . , u n } — > A such that 

A, h(ui), . . . , h(u n ) \= h A • • • A t m , 

we have that (h(v\), . . . , h(v r )) € P*( A ) where to = P(vi, ... ,v r ) 
Since $ is a monotone operator we can define its minimum fix point : STR[r'] — > 
STR[r'] as: 

$*(A) = |J $ n (A), for every A 

n>0 

where <3>°(A) = A and <£ n+1 (A) = $($ n (A)),n > 0. Alternatively, $*(A) can be defined 
as the smallest r'-structure with universe A such that A C $*(A) and $($*(A)) = $*(A). 
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A distinguished IDB predicate P is designed to be the goal of the program. A Datalog 
program is a query that given a r-structure A, returns P**( A ), where A' is the r'-structure 
with the same universe of A and such that for all R G r, R A = R A and for all R G tidb, 
R A = 0. We say that a structure A is accepted by the Datalog Program Q iff p**( A ) ^ 0. 

Let C be a set of r-structures and let Q be a Datalog Program. We say that C is defined 
by Q if for every r-structure A, A is in C iff A is accepted by Q. It is well-known and easy 
to see that if C is accepted by a Datalog Program then C must be a filter. 

The following result relates linear Datalog programs with infinitary logics. 

Theorem 4. Let < j < k be non-negative integers, and let C be a set of r-structures 
for some vocabulary r. If C is definable by a linear (j, fc)-Datalog Program then it is also 
definable in M j ' k . 

Proof. Let Q be a linear (j, fc)-Datalog Program. We will show that: (*) for every r- 
structure A, for every intensional predicate R G riDB> for every n > and for every 
{a±, . . . ,a r ) G R®"( A \ there exists some structure B, some bi,...,b r G B, some path- 
decomposition Si, . . . , S s of width (j, k) of B with {bi,... ,b r } C Si, such that . . . , b r ) G 
i?* n ( B ), and B, bi, . . . , b r — ► A, ai, . . . , a r . We will prove statement (*) by induction on n. 
The statement is vacuously true for n = 0. We will show that (*) holds for n + 1 whenever 
it holds for n. Let R G tibb be a intensional predicate and let (ai, . . . , a r ) be any tuple in 
^«+i(A'). If < ai ,..., 0r ) is m i?* n ( A ') then we are done. Otherwise there exists some rule 
R(yi, . . . , y r ) :- ti, . . . , t m in Q, over the variables u\, . . . , u^', k' < k and some mapping 
/i : {ui, . . . , Uk>} — > ^4 such that <I> n (A'), h(ui), . . . , h{uk') \= t± A ■ ■ ■ A t m , and a, = h(yi), 
for every 1 < i < r. Let us consider two cases: if the body of the rule does not contain any 
intensional predicate, then A, h(ui), . . . , h{uk') \= ti A • • • A £ m . Then, A^/j^^ ,_,^(u k ,)} an( ^ 
h(ai ),..., h(ak') satisfy statement (*). Otherwise, let us assume that the body contains 
one occurrence of an intensional predicate, say t\ = Ri(xi, . . . ,xi). In this case, since 

(h(xi),...,h( Xl ))eRr {A,) , 
we can assume, by induction hypothesis, that there exists some B and some bi, . . . , b\ in 
B, such that (bi, . . . , b r ) G i?**( B ) and B, bi, . . . , bi — > A, h(xi), . . . , h(xi). Furthermore 
there exists a path-decomposition Si, . . . , S s of B such that {bi, . . . , bi} C Si. We can 
assume, by renaming elements of B if necessary, that bi = h{xi) for every 1 < i < I and 
that every other element of the universe of B does not belong to A. Consider the structure 
C given as B U A| {/l ( Ml)v .. )fe ( Ufc , )} . By construction, <J> n (C), h(u{) h{u k ,) \= ti A ■ • ■ A t m 
and consequently, (h(xi), . . . , h(xi)) G ^ < J >n+1 (C). 

Furthermore {h(ui), . . . , h(uk>)}, Si, . . . , S s is a path-decomposition of C of width (j, k). 

In order to finish the proof, let O be the set containing all r-structures of pathwidth 
(j,k) in C. We shall see that O is an obstruction set of -iC. First, as O only contains 
structures in C and C is a filter, we can infer that for every structure A, such that B — > A 
for some B in O, A is in C. Conversely, let A be any structure in C. By the definition of 
acceptance by a Datalog Program, we can conclude that P**( A ') ^ 0. By (*) we have that 
there exists some r-structure B with pathwidth at most (j, k) such that P**( B ) ^ (and 
hence B G O), and B — > A. 

□ 

Lemma 1. Let < j < k be non-negative integers and let C be a collection of r-structures. 
The two following sentences are equivalent: 
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(1) C is definable in linear (J, /c)-Datalog. 

(2) —iC is definable in j'-adic fc-ary restricted Krom monotone SNP. 
Furthermore, if —>C is an ideal we also have that (1) and (2) are equivalent to: 

(3) —iC is definable in j-adic fc-ary restricted Krom SNP with equalities. 

Proof. To show the equivalence between (1) and (2) is straightforward but laborious. Here 
we will only sketch briefly the proof. To proof (1)— >(2) we show that for any any linear 
(J, &)-Datalog program Q, there exists a sentence p in j-adic fc-ary restricted Krom mono- 
tone SNP, such that for every structure A, A |= tp iff and only if Q does not accept A . 
The sentence <p is of the form 3R\, . . . , R{iv\, . . . , Vkip(vi, . . . , v^) where: 

• Ri, . . . , Ri are the IDBs of the Datalog Program Q 

• vi, . . . ,i>k are the variables occurring in the rules of the Datalog Program. Note: 
We can assume that the variables in the Datalog Program have been renamed so 
that every rule has its variables among vi,...,Vk 

• ip(vi, . . . , Vk) is a CNF formula that is defined as 

ip(vi, ■ ■ ■ , Vk) = ~>P A /\ toV-.tiV---V-.tm 

to :- t\,...,t m &Q 

where P is the goal predicate of Q. 
It is immediate, although again laborious, to show that ip is a sentence in j-ary k-ary 
restricted Krom monotone SNP that is satisfied precisely by those structure that are not 
accepted by Q. 

For the converse ((2)— >(1)), let (p = 3R\, . . . , R{iv\, . . . , Vktp(vi, . . . , Vk) be an arbitrary 
sentence in j-adic fc-ary restricted Krom monotone SNP. We shall construct a linear (J, k)- 
Datalog Program Q in the following way: 

• The IDBs of Q are precisely the second order predicates of cp, Ri, . . . , Ri plus a new 
0-ary IDB, P, which will act as the goal predicate. 

• The rules of Q are constructed from ip in the following way. The formula ip is a 
CNF and henceforth is the conjunction of several subformulas, where each one of 
this subformulas is the disjunction of atomic or negated atomic formulas. In fact, 
since we are dealing with a restricted Krom monotone SNP formula each one of this 
subformulas has to be of a very restricted form. More precisely, every one of the 
subformulas that constitute ip has to be either the disjunction of negated atomic 
formulas —>ti V • • • V ->t m , or it can contain one unnegated atomic formula, that is, 
it has to be of the form t Q V — >ti V • • • V ->t m where the underlying predicate of to is 
an IDB. 

Datalog Program Q contains a rule per each subformula in ip. If the subformula 
is of the form — >ti V • • • V ->t m then the rule added to Q is P :- ti, . . . , t m . Otherwise, 
if the subformula is of the form to V -itj V • • • V ->t m then the rule is of the form 
to •" ti , . . . , t m . 

Again it is an easy exercise to show that the Datalog Program Q defined is indeed a linear 
(j, A:)-Datalog Program that accepts precisely those structures that falsify tp. 

To see that if C is a filter then (3) implies (1) we make use of a result by Feder and 
Vardi JJ| which states that for every Datalog(^,-.) Program P, that is, every Datalog 
Program in which we allow inequality and the negation of EDBs, that defines a filter C, 
there exists a Datalog Program P' , that defines the same set of structures C. Furthermore, 
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a closer inspection to the proof in reveals that if P is in linear (j, /c)-Datalog(^,->), then 
P' is in linear (j, /c)-Datalog.. 

The proof that (3) implies (1) mimics that of (2) implies (1). As before let <p = 
3R±, . . . , Rftvi, . . . , Vkip(vi, . . . , Vk) be an arbitrary sentence in j-adic /c-ary restricted Krom 
SNP with equalities. We shall construct a linear (j, fc)-Datalog Program Q in a similar 
fashion: 

• The IDBs of Q are precisely the second order predicates of ip, R\,...,Ri plus a new 
0-ary IDB, P, which will act as the goal predicate. 

• The rules of Q are constructed from ip in the following way. The formula tp is a CNF 
and henceforth is the conjunction of several subformulas of the form t\ V • • • V t m 
, where each ij, 1 < i < m is an atomic or a negated atomic formula. Since the 
formula ip is not supposed to be monotone and might contain equalities we only 
can assume that each one of the disjunctions t\ V • • • V t m of ip has at most one 
positive occurrence of a second-order variable and at most one negative occurrence 
of a second order- variable. As before, we add to Q a rule per each disjunction 
t\ V • • • V t m in ip. If t\ V • • • V t m does not contain any positive occurrence of an 
IDB, then the rule added to Q is P :- —>ti, . . . , _, t m . Otherwise, if t\ V • • • V t m 
contains one positive occurrence of an IDB, say t\, then the rule added to Q is 
t± :- -1*2, ■ • • ; ^t m . 

Observe that the body of a rule in Q might contain equalities and negated atomic formulas 
-iJ?(xi, . . . , x r ) where R is an EDB. Consequently, Q is a Datalog(^,-i) Program. It is 
easy to see that Q is, indeed, a linear (J, fc)-Datalog(^,-i) Program and that Q accepts 
accepts precisely those structures that falsify ip. Since C is a filter, by the result in [TT] 
mentioned above there exists a linear (j, /c)-Datalog Program (that is, without inequalities 
and negations) that defines C. 

The implication ((2)— >(3)) is trivial. □ 

If we are dealing with set of structures —>C of the form /(B) for some r-structure B, 
then we can combine Theorem|21 Lemma^ Theorem|21 and Theorem^ obtaining the main 
result of this paper. 

Theorem 5. Let < j < k be non-negative integer, let B be a r-structure, and let 
—>C = CSP(B) = /(B). The following sentences are equivalent: 

(1) The class C is M^ fc -definable. 

(2) The class C is A"' fc -definable. 

(3) The class C is definable in linear (j, /c)-Datalog. 

(4) If A and B are finite structures such that A G C and Duplicator has a winning 
strategy for the (j, A;)-PR game on A and B, then B G C. 

(5) The class —>C has an obstruction set with pathwidth at most (j, k) . 

(6) The class —>C is definable in j-adic £>ary restricted Krom SNP with equalities. 

(7) The class —>C is definable in j-adic fc-ary restricted Krom monotone SNP. 

Let B be a r structure. If —>C = /(B) satisfies any of the conditions of Theorem [3 we 
say that B has (j, k)-path duality. Finally, we say that B has bounded path duality if B has 
(j, /c)-path duality for some < j < k. 
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7. Applications in Computational Complexity 

In this section we bring computational complexity into the picture. We shall start by 
proving that for every finite structure B with bounded path duality, CSP(B) is in NL. 
This result is an immediate consequence of the following theorem, proven in |14j . 

Theorem 6 f|14j). Let (p be a Krom CNF formula over a vocabulary r. The problem of 
deciding, given a T-structure A, whether A |= <p, is solvable in NL. 

Indeed, if B has bounded path duality, then CSP(B) is definable in Krom SNP. Hence 
we have, 

Proposition 3. Let B be a r-structure with bounded path duality. Then CSP(B) is in 
NL. 

In what follows, we shall revisit all families of constraint satisfaction problems, that 
up to the best of our knowledge, are known to be in NL. Our goal here is to show that 
the notion of bounded path duality provides a unifying framework that encompasses and 
explains this results. In the last part of this section, we shall see how the notion of bounded 
path duality can be used to place new problems in NL. 

7.1. Implicational constraints. The class of implicational constraints was introduced 
independently by Kirousis [2S1 and by Cooper et al. [I] (in this second reference they were 
named 0/1 /all constraints). A binary relation R C A 2 over a finite domain A is called 
implicational if it has one of the following three forms: (1) B X C for some B,C C A, (2) 
{(a, /(a)) : a 6 B} for some B C A and some one-to-one mapping / : B — > A, or (3) 
({&} x C) U (B x {c}) for some B,C C A, b G B, and c G C. A structure B is said to be 
implicational if so are all its relations. It is easy to observe that 2-SAT can be encoded as 
a constraint satisfaction problem CSP(B2_sat)) with B2-SAT implicational: 

The signature r of B2-SAT contains three binary relation symbols Po> -Pi and Pi- The 
universe of B is {0,1} and the values of p^ 2 - SAT 5 pB 2 _ S AT^ anc j pB 2 _sAT are reS p ec tively 
{0, l} 2 /{(0, 0)}, {0, l} 2 /{(0, 1)}, and {0, 1} 2 /{(1, 1)}. Observe that each one of these rela- 
tions is implicational. 

It is well known and quite easy to see that 2-SAT and CSP(B2_sat) are logspace re- 
ducible to each other. Indeed, let <p be any arbitrary 2-CNF formula. We shall show that it 
is possible to construct in logarithmic space a T-structure A such that A is homomorphic 
to B2-SAT if and only if ip has a solution. The universe A of A is given by the variables of 
ip. Furthermore, P^ contains for each clause (x V y) with only positive literals, the tuple 
(x,y). Observe that every homomorphism from A to B should set values to x and y that 
belong to p^ 2 - SAT = {0, l} 2 /{0, 0}. Hence the values set to x and y must satisfy clause 
(xVy). 

A similar processing is applied to the other types of clauses. More precisely, for each 
clause with exactly one positive literal (x V —>y) (let us assume that the literals are ordered 
so that the positive literal comes first) we add the tuple (x, y) to P x and for each clause 
(—ix V ->y) with only negative literals we add the tuple (x, y) to P 2 A - K i s very easy to verify 
that satisfying assignments of ip correspond to homomorphisms from A to B2-SAT and that 
this reduction can be performed in logarithmic space. The reduction from CSP(B2_sat) to 
2-SAT is also very simple and we omit it. 

Let B be any implicational structure and let A be any instance of CSP(B). We 
first recall a well-known result about constraint satisfaction problems CSP(B) with B 
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implicational which can be found in |23j . If A is an instance of CSP(B) we define the 
conflict graph G associated to A and B as the digraph G = (V,E), with set of nodes 
V = {(a, b) : a E A,b E B} U {□} and set of edges E constructed in the following way: 

(a) For every predicate R in the vocabulary r of A, for every b, b' E B such that 
R B n ({6} x B) = {(&, b')}, and for every tuple (a, a') E R A we add to E an arc from 
(a, b) to (a',b'). 

(b) For every predicate R in the vocabulary r of A, for every b, b' E -B such that 
R B D (B x {b'}) = {(b,b')} 7 and for every tuple (a, a') E R A we add to E an arc 
from (a', 6') to (a, 6)). 

(c) For every predicate R in the vocabulary r of A, for every b E B such that R B n 
({6} x B) = 0, and for every tuple (a, a') E R A we add to £7 an arc from (a, 6) to □. 

(d) For every predicate R in the vocabulary r of A, for every b' E B such that R B n 
(2? x {6'}) = 0, and for every tuple (a, a') E -R A we add to E an arc from (a', b') to 
□ . 

Lemma 2. A is homomorphic to B iff for every a E A there exists some b E B such that 

there is not a path from (a, 6) to any some node in {(a, 6') : b' E 2? 

{6}}U{D}. 

Let us see that this implies that each implicational B has (2, 3)-path duality. 
Lemma 3. Every implicational structure B has (2,3)-path duality. 

Proof. Let A be any structure not homomorphic to B and let G be the conflict graph 
associated to A and B. Let {b%, . . . ,b m } be the universe B of B. We shall construct a 
structure P with pathwidth at most (2, 3) that is homomorphic to A but is not homomorphic 
to B. 

By Lemma|2]there exists some a* E A such that for all bi, 1 < i < m, there is a path in 
G from (a*, bi) to either (a*, bj) for some j ^ i or □. Let us denote by (a*, bj) = 14, v\, ■ ■ ■ , v\. 
the elements of such path. Observe that v\_ is either (a*, bj) for some j ^ i or □. 

The universe P of P contains for each 1 < i < m, and for each node vt, 1 < A; < li 
in the path corresponding to bi an element loi. If the same node of A appears in several 
paths then we make different copies of it. Finally we merge nodes w l k , corresponding to 
v k = ( a *' ^i) f° r some 1 — I — m i n t° a unique node that we shall call w. Observe for all for 
all 1 < i < m, w\ is w. 

Now we shall construct the relations of P and, at the same time, a homomorphism h 
from P to A. 

Let i be any integer with 1 < i < m and let Vj, Vj +1 be any two consecutive elements 
in the path corresponding to 6j. Then, is an arc of G that must have been added 

according to (a), (6), (c) or (d). If (Vj,Vj +1 ) has been added according to (a) then there 
exists some predicate R in r, some b,b' E B such that R B n ({6} x B) = {(6, 6')}, and some 
tuple (a, a') E i? A . Then we add to R F the tuple The mapping h maps to 

a and to a 1 . Similarly, if has been added according to (b) then there exists 

some predicate R in r, some b,b' £ B such that i? B fl(5x = {(b, b')}, and some tuple 
(a, a') E i? A . In this case we add (Wj +1 ,Wj) to R F and set h(v^) = a' and = a. The 

cases (c) and (d) are dealt with in a similar fashion. 

By construction, h defines a homomorphism from P to A. We shall now prove that P 
is not homomorphic to B. Towards a contradiction let us assume that there exists such a 
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homomorphism / and let b{ be the image of w according to /. Let us consider the nodes 
in P, w = w\,w l 2 , ... ,w\. associated to the path v\, v%, . . . , v\ . The first arc (v\, v\) of the 
path has been added due to one of the conditions (a — d). If it was added due to (a) then 
there exists some predicate R in r, some b,b f G B such that i? B PI ({&} x B) = {(&,&')}. 
Furthermore {wi^w^) belongs to Rp . Consequently, if / is a homomorphism it must map 
w\ to b' . We can do a similar reasoning for the other conditions (b — c) and for the remaining 
nodes of the path until we reach the last element of the path v\. . Here we should consider 
two cases. If the arc {v\._ l: v\) has been added due to conditions (a) or (6), then can 
conclude that w\, is w and f(w) = bj for some j ^ i, obtaining a contradiction. Otherwise, 
If the arc (vj.^, v\) has been added due to conditions (c) or (d), then we can conclude that 
/ is not a homomorphism, getting again a contradiction. 

It only remains to show that P has pathwidth at most (2, 3). The key observation here 
is that each one of paths v\, . . . v\. gives rise to a collection of nodes w\ , . . . , w\. of P such 
that the restriction P|m w j } of P to {w\, . . . , w\^} has the following path-decomposition: 

{w\,w\}, . . . ,{w t li _ 1 ,Wi i }. Consequently, since w is the only common node to each such 
restriction we can easily conclude that P has the path-decomposition: 

{w,wl,wl},{w,wl,wl},.. . ,{w,wl 1 _ 1 ,wl 1 },{w,wj,w^},...,{w,w^ i _ 1 ,w^ i } 

□ 

Using a very similar line of reasoning is it possible to push this results a bit further. 
In particular it is possible to show [7] that every structure B that contains only relations 
invariant under an operation in the pseudovariety generated by all dual discriminator oper- 
ations has also bounded path duality. A complete presentation of this result would require 
a lengthy introduction of the algebraic approach to CSP. Since this would lead us out of 
the scope of this paper we omit it and instead we refer to [Jj. 

Another related family of constraint satisfaction problems solvable in NL is the class 
of all CSP(B) where B is a poset with constants invariant under a near-unanimity opera- 
tion |28| . Again for space limitations we shall not present this result. In [2H] it is shown 
that for every such poset B, -i CSP(B) is definable in pos FO+TC, that is, the fragment of 
FO+TC in which negation and universal quantification is not allowed. It is known (folklore) 
that pos FO+TC and linear Datalog have the same expressive power. Consequently, we 
can infer that B has bounded path duality. 

7.2. Implicative Hitting-Set Bounded. The class of Implicative Hitting-Set Bounded 
was introduced in 0. Let k be any integer greater than 1. A Boolean relation R is in 
&-IHS-B+ if it can be expressed as a CNF where each clause is of the form -if, —iv V w 
or w\ V • • • V wt (here we do not require that all w^s are different). Similarly, R is in 
A;-IHS-B— if it can be expressed as a CNF where each clause is of the form w, —ivVw or 
—>v% V • • • V A relational structure B is implicative Hitting-Set Bounded there exists 
some k > 2 such that all its relations are in A;-IHS-B+ or A;-IHS-B— . It is well known 
that for every Implicative Hitting-Set Bounded structure B, CSP(B) is solvable in NL. We 
shall prove that it has bounded path width duality. 

Lemma 4. Let B be a relational structure containing only relations in A:-IHS-B+ for some 
k > 2. Then B has (k, k — 1 + p(B))-path duality. Similarly, if B contains only relations in 
A;-IHB-B- then B has (k, k - 1 + p(B))-path duality. 
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Proof. We shall consider only the case in which all relations of B are in &-IHS-B+. The 
case fc-IHS-B— is completely symmetric. We need to introduce a bit of notation. Recall 
that given a relation T and a subset I of {1, ... , p(T)}, TW denotes the restriction of T to 
/. We generalize slightly this definition by allowing sequences of integers instead of merely 
sets. More formally, let i\, . . . , i& be (not necessarily different) integers in {1, ... , p(T)}. By 
2]«i,...,i fe we denote the fc-ary relation 

{(<%,. • -,a ik ) : (oi, . . . ,o p (t)) S T} 

Let us start by introducing a property of unsatisfiable formulas which will be of most 
help in providing intuition on the proof. 

Lemma 5. Let $ be a CNF formula containing only clauses of the form —iv, -loVioor 
w% V • • • V W}.. If $ is unsatisfiable then there exists a clause of the form w\ V • • • V Wk such 
that for each i E {1, . . . ,k} there exists a sequence of variables Wi = v\, . . . , v\. such that 
an (— iwj V fj+i) for all j G {1, . . . , li — 1} are in $ 

In order to prove the theorem we need a generalization of this result. Let A be an 
instance of CSP(B) and let ft be a partial mapping from the universe A to B. In what 
follows we shall denote a partial mapping as {a\ —*■ b%, . . . , a m — » b m ) where {di, . . . , a m } is 
the domain of the partial mapping and for each 1 < i < m, a,i is map to 6j. We say that 
a partial mapping is forbidden if there exists a predicate R in the vocabulary r of B and 
some integers ii,...,i m such that (a\, . . . , a m ) G R$ lt _ >im and (&i, . . . , b m ) jim - 

Lemma 6. Let A be any unsatisfiable instance of CSP(B) where B contains only relations 
in /c-IHS-B+. Then there exists some elements a*-, 1 < i < k, 1 < j < li of A such that 

(a{^ 0,...,af -> 0), (ojj -»■ 1), 1 < * < Jb, and (oj -» l,aj +1 0), 1 < i < k, 1 < j < h-1 
are forbidden partial mappings 

Proof. The proof of this lemma is rather straightforward. We only present here an sketch 
of the proof. In a first step, taking into account that every relation in B is in &-IHS- 
B+, it is possible to construct from A and B, an unsatisfiable CNF formula in which the 
variables are elements of A and every clause is of the form —iv, —>v\/w, or wx V • • • V Wf.. 
Observe that we can associate to each clause l\ V • ■ ■ V l r in $ a forbidden partial mapping 
— > b\ , . . . , v r — > b r ) where Vi is the variable underlying literal li and b\ is if is positive 
and 1 otherwise. Finally the result follows by applying Lemma to formula <!>. □ 

We are now in a position to construct a r-structure P of (k,k — 1 + p(B)) that is 
homomorphic to A but not homomorphic to B. The universe P of P contains for each 
1 < i < k and for each element a*-, 1 < j < U in the sequence guaranteed by the previous 
lemma, an element c*- . If the same element appears more than once then we make different 
copies. The universe of P contains also more elements that will be added as needed. 

We now construct the relations of P. At the same time we shall define a homomorphism 
h from P to A. The intuition of the construction is very similar to the proof of Lemma 01 

Let (a\ —> 0, . . . , a\ — > 0) be the first of the forbidden partial mappings guaranteed to 
exist by Lemma H3 Consequently there exists some predicate symbol R in r, some tuple 
(t>i, . . . , v p ( R )) £ R A and some ji,...,jk G {1, . . . , p(R)} such that v jl = a\, . . . , v jk = 
a\. Then we include in R p a tuple (w\, . . . ,w p m-s) containing c\ in its jjth position, i G 
{1, . . . , Wk} and new variables not occurring in P elsewhere. We set h(wi) to be vi. We 
associate to the forbidden partial mapping (a\ — > 0, ...,a\ — ► 0) a set 5^ a i^ a fe^ ) 
that contains all variables {w\, . . . ,w p (R)} (we shall use this set later to define the path 
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decomposition of P). We proceed in a similar fashion for each forbidden partial mapping 
guaranteed by Lemma HO 

By construction h defines a homomorphism from P to A. Furthermore P has has 
pathwidth at most (k,k — 1 + p(B)) as it is certified by the following path-decomposition: 

S(a{^0,...,a^0)i S(a{-*l,a%-yO) U { c l> ■ ■ ■ > C 1J ' " " " ' S(a} _ 1 -»l,o i 1 -»0) U { c l> • • ' c li' 
S {a ^ 1} U {cl c*}, s(l 3 -i.»M U {c?, . . . , c*}, ^ • , _ ljaf _ )> S(a? -1) 

Finally, we shall now see that P is not homomorphic to B. Towards a contradiction, let 
/ be such homomorphism. By the construction of P we can conclude that (c\ — ► 0, . . . , c\ — > 
0) is a forbidden mapping of P and B. Consequently, for some i, f(c\) = 1. Consider 
now the sequence c\, c3j, . . . , c\,. By the construction of P for each j £ {1, . . . , Zj — 1}, 
(dj — > 1, c* +1 — > 0) is a forbidden mapping. Consequently we can infer by induction that for 
every j S {1, . . . , /j}, /(c* ) = 1. We get then a contradiction with the fact that (c\. — > 1) is 
a forbidden mapping. 

□ 

7.3. New problems in NL. In this section we shall use the notion of bounded path as 
a tool to identify some other constraint satisfaction problems in NL. We have mentioned 
already in Section lTTl that every structure invariant under an operation in the pseudovariety 
generated by all dual discriminator operations has bounded path duality. More examples 
of constraint satisfaction problems CSP(B), such that B has bounded path duality can 
be found in the literature about H-coloring which can be reformulated as the subcase of 
the general constraint satisfaction problem CSP(H) when H is a (di)graph. As mentioned 
in the introduction, the notion of bounded tree duality which is intimately related to the 
notion of path duality (and in fact, inspired it) has been deeply investigated in the field of 
H-coloring. Let us recall that a (di)graph H has has bounded tree duality if CSP(B) has an 
obstruction set containing only graphs of treewidth at most k for some fixed k. Most of the 
tractable cases of the H-coloring posses bounded tree duality. Indeed, a closer inspection of 
those results shows that most times, the obstruction set corresponding to a given (di)graph 
H contains only structures with bounded pathwidth. Consequently, as a direct consequence 
of Proposition |3 we can lower the complexity of this problems from P to NL with virtually 
no effort. 

Let us revisit now some of these examples. For undirected graphs, the only case solvable 
in polynomial time corresponds to bipartite graphs JH]- It is fairly easy to see that the 
set of all odd cycles is an obstruction set of CSP(H) for any bipartite graph H. As cycles 
have pathwidth at most (2, 3) we can conclude that every bipartite graph H has (2, 3)-path 
duality. We have thus strengthened the dichotomy result of ^H] : for every undirected graph 
H, CSP(H) is in NL or NP-complete. 

For directed graphs, it is a direct consequence of the results in ^3 EH (see also [T7] ) 
that every oriented path, directed cycle, and even more generally, every unbalanced oriented 
cycle has bounded path duality. 

Let us present these results. An oriented path is a digraph H with nodes po, . . . , p n such 
that for each i G {0, . . . , n — 1} either (pi,Pi+i) or (pi+i,Pi) is an edge of H and it does not 
contain any other edge. It was shown in 18 that for every oriented path H, CSP(H) has 
an obstruction set containing only oriented paths. It is easy to observe that the Gaifman 
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graph of an oriented path is a path and hence has pathwidth at most (1,2). Consequently, 
oriented paths have (l,2)-path duality. 

An oriented cycle is a digraph H with nodes Po, ■ ■ ■ ,p n such that for each i G {0, . . . , n}, 
either (pi,pi + i) or (pi+i,Pi) is an edge of H (the sum is modulo n + 1). If additionally all 
edges are in the same direction then H is called a directed path. More generally, if the 
number of forward edges is different than the number of backward edges then H is called 
unbalanced. 

In ^S], it is shown that for each unbalanced oriented cycle H, CSP(H) has an obstruc- 
tion set containing only oriented cycles. As oriented cycles have pathwidth at most (2, 3) 
we can conclude that H has (2, 3)-path duality. 

We conclude this section by examining the problem of deciding whether a given struc- 
ture B has bounded path duality. In general, this problem is not known to be decidable. 
However, in a particular case it is possible to show decidability, by mimicking some results 
originally proven in |12j . 

Theorem 7. For every 1 < k, the problem of deciding whether a r-structure B has (1, k)- 
path duality is decidable. 

Proof. By Theorem [3 there exists a sentence <p in 1-adic (also called monadic) monotone 
SNP such that for every r-structure A, A (= ip iff the Duplicator has a winning strategy for 
the (l,fc)-PR game on A and B. It is widely known (see |12j ) that there exists a sentence 
ip in monadic monotone SNP such that for every r-structure A, A |= ip iff A G CSP(B). 
Thus deciding (1, fc)-path duality is equivalent to decide whether (ip =^> ip) is a tautology 
for finite structures. The latter is decidable since so is containment for monadic monotone 
SNP P2J. □ 

8. Open Problems 

It is unknown whether bounded path duality captures the class of CSPs solvable in 
NL, i.e, whether there exists a structure that does not have bounded path duality such 
that CSP(B) is solvable in NL. In this direction, it is not difficult to show, using some 
results in |14| . that for every structure B, if CSP(B) is in NL, then it can be expressed 
in restricted Krom monadic SNP with equality, s(x,y) (true when y is the immediate 
successor of x in some total ordering), and max (first and last elements of the total 
ordering respectively). Here we understand that the homomorphism condition only applies 
to non-built-in predicates. 

Another interesting problem is to extend the class of structures B known to have 
bounded path duality via closure conditions. With that respect, it would be interesting to 
decide whether majority operations, or even more generally, near-unanimity operations 35 
(see also guarantee bounded path duality. This result is true, in particular, for struc- 
tures with domain of size at most 2. 
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